Dissertation Spatial Hearing, Auditory Sensitivity, and Pattern Recognition in Noisy Environments
نویسنده
چکیده
This dissertation investigates how human perceivers and learning algorithms cope with noisy, complex environments. The first part of the dissertation presents results of psychoacoustic experiments that give insight into human auditory perception of sounds originating near the listener. The second part develops a memory-based computational learning architecture that is resistant to noise while limiting its own internal complexity. The first part of the dissertation describes results of two psychoacoustic experiments. The first experiment investigates the ability of human listeners to employ binaural auditory processing when detecting a sound from various positions around the listener. The experiment is performed in a simulated noisy anechoic environment, which provides more realistic spatial cues than have been tested in previous headphone studies of masked sound detection while allowing the acoustic signals to be fully characterized. The results support the idea that performance is determined by brainstem auditory processing and is not influenced by any higher-level sound-location-based processing. The second psychoacoustic experiment studies how the listener’s position in a room and the listener’s experience in a room influence the ability to accurately localize nearby sounds in azimuth and distance. The results show that the listener position in the room influences response variability and that as a listener becomes familiar with a room, passive learning occurs, reducing response variability. Finally, the presence of a nearby wall has a small influence on azimuthal response bias, but no noticeable effect on perceived source distance. In the second part of the dissertation, a memory-based learning system, called PointMap, is proposed. This system implements new methods of pruning based on informative value of the stored memories. It is shown that such a “forgetting” mechanism can solve the problems of category proliferation and noise sensitivity in many learning systems. Variations on the Adaptive Resonance Theory systems are also investigated. Both portions of this dissertation demonstrate that learning enables humans and machines to perform robust pattern recognition tasks in the face of noise and uncertainty in the inputs.
منابع مشابه
The effects of traffic noise on memory and auditory-verbal learning in Persian language children
Background: Acoustic noise is one of the universal pollutants of modern society. Although the high level of noise adverse effects on human hearing has been known for many years, non-auditory effects of noise such as effects on cognition, learning, memory and reading, especially on children, have been less considered. Factors which have negative impact on these features can also have a negat...
متن کاملIntegrating Adaptive Beam-forming and Auditory Features for Robust Large Vocabulary Speech Recognition
We demonstrate a system to integrate adaptive beam-forming and auditory features in order to improve speech recognition accuracy in noisy environments. Adaptive beam-forming based on a microphone array can utilize spatial information to improve the sound recording signal-to-noise ratio (SNR) on a focused speaker for robust speech recognition. Auditory features based on modeling the signal proce...
متن کاملValidity and reliability of the Persian version of spatial hearing questionnaire
Background: Our hearing ability in space is critical for hearing speech in noisy environment and localization. The Spatial Hearing Questionnaire (SHQ) has been devised to focus only on spatial haring tasks (e.g., lateralization, distance detection and binaural detection). The aim of the present study was to determine the reliability and validity of the Persian translation of the SHQ (Spatial ...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملIntegrating Monaural and Binaural Cues for Sound Localization and Segregation in Reverberant Environments
The problem of segregating a sound source of interest from an acoustic background has been extensively studied due to applications in hearing prostheses, robust speech/speaker recognition and audio information retrieval. Computational auditory scene analysis (CASA) approaches the segregation problem by utilizing grouping cues involved in the perceptual organization of sound by human listeners. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002